Cascaded Tuning to Amplitude Modulation for Natural Sound Recognition
نویسندگان
چکیده
منابع مشابه
Cascaded Amplitude Modulations in Sound Texture Perception
Sound textures, such as crackling fire or chirping crickets, represent a broad class of sounds defined by their homogeneous temporal structure. It has been suggested that the perception of texture is mediated by time-averaged summary statistics measured from early auditory representations. In this study, we investigated the perception of sound textures that contain rhythmic structure, specifica...
متن کاملNeural modulation tuning characteristics scale to efficiently encode natural sound statistics.
The efficient-coding hypothesis asserts that neural and perceptual sensitivity evolved to faithfully represent biologically relevant sensory signals. Here we characterized the spectrotemporal modulation statistics of several natural sound ensembles and examined how neurons encode these statistics in the central nucleus of the inferior colliculus (CNIC) of cats. We report that modulation-tuning ...
متن کاملAmplitude Modulation Maps for Robust Speech Recognition
Two recognition tasks are discussed in which pre-processing based on amplitude modulation (AM) maps is compared with other feature extraction strategies. In the first task we show how the AM map representation can be used to segregate voiced speech signals from one another. The second shows how the AM representation can be used for robust digit recognition in additive noise. Natural vowels from...
متن کاملSpeech Emotion Recognition Using Amplitude Modulation Parameters
In the community of Human Computer Interface (HCI) researchers have been working for several years in trying to emulate a human communication system, using innovative technologies and methodologies, based on the emotion recognition in facial expressions and speech [1-3]. Speech emotion recognition (SER) [4] is a challenging framework in demanding human machine interaction systems. Standard appr...
متن کاملAmplitude modulation features for emotion recognition from speech
The goal of speech emotion recognition (SER) is to identify the emotional or physical state of a human being from his or her voice. One of the most important things in a SER task is to extract and select relevant speech features with which most emotions could be recognized. In this paper, we present a smoothed nonlinear energy operator (SNEO)-based amplitude modulation cepstral coefficients (AM...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of Neuroscience
سال: 2019
ISSN: 0270-6474,1529-2401
DOI: 10.1523/jneurosci.2914-18.2019